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Abstract — In this paper, we are interested in improving the 
performance of constructive network coding schemes in lossy 
wireless environments. We propose I 2 NC - a cross-layer approach 
that combines inter-session and intra-session network coding 
and has two strengths. First, the error-correcting capabilities 
of intra-session network coding make our scheme resilient to 
loss. Second, redundancy allows intermediate nodes to operate 
without knowledge of the decoding buffers of their neighbors. 
Based only on the knowledge of the loss rates on the direct 
and overhearing links, intermediate nodes can make decisions 
for both intra-session (i.e., how much redundancy to add in 
each flow) and inter-session (i.e., what percentage of flows to 
code together) coding. Our approach is grounded on a network 
utility maximization (NUM) formulation of the problem. We 
propose two practical schemes, I 2 NC-state and I NC-stateless, 
which mimic the structure of the NUM optimal solution. We also 
address the interaction of our approach with the transport layer. 
We demonstrate the benefits of our schemes through simulations. 

Index Terms — Network coding, wireless networks, error cor- 
rection, cross-layer optimization. 



I. Introduction 

Wireless environments lend themselves naturally to network 
coding (NC), thanks to their inherent broadcast and overhear- 
ing capabilities. In this paper, we are interested in wireless 
mesh networks used for carrying traffic from unicast sessions, 
which is the dominant traffic today. Network coding has been 
used as a way to improve throughput over such wireless 
environments. Given that optimal inter-session NC for unicast 
is still an open problem, constructive approaches are used in 
practice Q], (2), Q), ID, (5). One of the first practical wireless 
NC systems is COPE Q - a coding shim between the IP and 
MAC layers that performs one-hop, opportunistic NC. COPE 
codes packets from different unicast sessions, and relies on 
receivers being able to decode these using overheard packets. 
This way, COPE combines multiple packets by using infor- 
mation on overheard packets which are exchanged through 
transmission reports and effectively forwards multiple packets 
in a single transmission to improve throughput. In order for 
COPE to work in a multihop network, nodes must cooperate 
to (i) exchange information about what packets they have 
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overheard and also (ii) code so that all one-hop downstream 
nodes can decode. This must be done at every hop across the 
path of a flow and cross-layer optimization approaches can be 
used J6] to further boost the performance. 

One important problem that remains open, and is the focus 
of this paper, is COPE's performance in the presence of non- 
negligible loss rates. The reason is that intermediate nodes in 
COPE require the knowledge of what their neighbors have 
overheard, in order to perform one-hop inter-session NC. 
However, in the presence of medium-high loss rate, although 
each node fully cooperates to report what it has overheard, 
this information is limited, possibly corrupted, and/or delayed 
over lossy wireless channels. COPE turns off NC if loss rate 
exceeds a threshold with default value 20% (2). However, 
this does not take full advantage of all the available NC 
opportunities. To better illustrate this key point, let us discuss 
the following example. 

Example 1: Let us consider Fig. Q] and focus on the 
neighborhood of node /, i.e., only the packets transmitted 
via /, from A\ to A2 and from B\ to Bi- This forms an 
"X" topology which is a well-known, canonical example of 
one-hop opportunistic NC H), Q- In me absence of loss, 
throughput is improved by 33.3%, because I delivers two 
packets in three transmissions (with NC), instead of four 
(without NC). Let us re-visit this example when there is packet 
loss. Assume that there is loss only on the overhearing link 
At — B2, with probability P{A t .B 2 } — 0-3, an d a ll other links 
have no loss. In this case, 70% of the packets can still be coded 
together, and throughput can be improved by 26%, which is 
still a significant improvement. Even at higher loss rate, e.g., 
P{a 1: b 2 } = 0-5, inter-session coding still improves throughput 
up to 20% This is under the assumption that / knows the 
exact state of A-2.,Bi, i.e., what packets were overheard, and 
thus / is able to decide what packets to code together so as 
to guarantee decodability at the receivers. However, at high 
loss rates, cooperation among nodes becomes difficult. This is 
why COPE turns off the coding functionality when loss rate 
is higher than a threshold with default value 20%, thus not 
taking full advantage of all coding opportunities. ■ 

We propose a solution to this problem with a design which 
combines intra- and inter-session NC over wireless mesh 
networks. We use intra-session NC to combine packets within 
the same flow and introduce parity packets to protect against 
loss. Then, we use inter-session NC to combine packets from 
different (already intra-session coded) flows, and thus increase 
throughput. Our approach for combining intra-session with 
inter-session NC, which we refer to as I 2 NC, has two key 
benefits. First, it can correct packet loss and still perform inter- 




Fig. 1. Example of a unicast flow (from Si to Ri) traversing multiple 
wireless hops. Each node performs (intra- and inter-session) NC. The neigh- 
borhood of / is shown here in detail. (Two unicast flows, Si — R± and 
S2 — R2, meeting at intermediate node /. / receives packets a, b from nodes 
Ai, Bi, respectively. It can choose to broadcast a, b or a + b in a single 
transmission to both receivers. The next hops A2, B2 can decode a+b because 
they overhear packets b and a transmitted from Bi,Ai, respectively.) 

session NC, even in the presence of medium-high loss rates, 
thus improving throughput. Second, the use of intra-session 
NC makes all packets in the session equally beneficial. Thus, 
I 2 NC eliminates the need to know the exact packets that have 
been overheard by the neighbors of intermediate node /. It 
is sufficient to know the loss probabilities of overheard and 
transmitted packets. In our scheme, this information is reported 
by each node to the nodes in its neighborhood which makes 
NC possible even at higher loss rates. 

Adding redundancy in this setting is non-trivial, since a flow 
is affected not only by loss on its direct links, but also by loss 
on overhearing links. This affects the decodability of coded 
packets. Therefore, the amount of redundancy needed to be 
determined carefully. 

Example 1 - continued: Consider again the neighborhood 
of / in Fig. 03 Flow 2 (originated from is affected not 
only by loss on its own path B\ — I — B2, but also by loss on 
the overhearing link A\ — B2, which affects the decodability 
of coded packet a + b at B2 . In order to protect flow 2 from 
high loss rate on the overhearing link A± — B2, I may decide 
either to add redundancy on flow 2, or to not perform coding, 
or a combination of the two. On the other hand, I may also 
decide to add redundancy on flow 1 (originated from Si), to 
correct loss on the overhearing link A\ — B2, thus helping Bj 
to receive a and decode a + b. ■ 

Therefore, a number of questions need to be addressed in 
the design of a system that combines both intra- and inter- 
session NC. In particular: 

Ql: How to gracefully combine intra- and inter-session NC? 
We propose a generation-based design, and specify the 
order we perform the two types of coding. 

Q2: How much redundancy to add in each flow? We show 
how to adjust the amount of redundancy after taking into 
account the loss on the direct and overhearing links. We 
implement the intra-session NC functionality as a thin 
layer between IP and transport layer. 

Q3: What percentage of flows should be coded together and 
what parts should remain uncoded? We design algo- 
rithms that make this decision taking into account the 



loss characteristics on the direct and overhearing links. 
We implement this and other functionality {e.g., queue 
management) performed with or after inter-session NC 
as a layer between MAC and IP. 
Q4: What information should be reported to make these 
decisions? We propose two schemes: I 2 NC-state, which 
needs to know the state (i.e., overheard packets) of the 
neighbors; and I 2 NC-stateless, which only needs to know 
the loss rate of links in the neighborhood. 
Our approach is grounded on a network utility maximization 
(NUM) framework Q. We formulate two variants of the 
problem, depending on available information (as in Q4 above). 
The solution of each problem decomposes into several parts 
with an intuitive interpretation, such as rate control, NC rate, 
redundancy rate, queue management, and scheduling. The 
structure of the optimal solution provides insight into the 
design of our two schemes, I 2 NC-state and I 2 NC-stateless. 

We evaluate our schemes in a multi-hop setting, and we 
consider their interaction with the transport layer, including 
TCP and UDP We propose a thin adaptation layer at the 
interface between TCP and the underlying coding, to best 
match the interaction of the two. We perform simulations in 
GloMoSim (8), and we show that our schemes significantly 
improve throughput compared to COPE. 

The structure of the rest of the paper is as follows. SectionHTl 
presents related work. Section [III] gives an overview of the 
system model. Section HVI presents the NUM formulation and 
solution. Section [V] presents the design of the I 2 NC schemes 
in detail. Section [Vl] presents simulation results. Section IVIII 
concludes the paper. 

II. Related Work 

COPE and follow-up work. This paper builds on COPE, 
a practical scheme for one-hop NC across unicast sessions 
in wireless mesh networks (2), which has generated a lot 
of research interest. Some researchers tried to model and 
analyze COPE 0, |10|, QTJ- Some others proposed new 
coded wireless systems, based on the idea of COPE lfl2l . 0. 
In 0~3], the performance of COPE is improved by looking 
at its interaction with MAC fairness. Our recent work in |6j 
improves TCP's performance over COPE with a NC-aware 
queue management scheme. This paper also improves COPE 
by adding intra-session redundancy with a cross-layer design 
and reducing the amount of information that is needed to be 
exchanged among nodes cooperatively, i.e., nodes no longer 
need to know the exact packets overheard by their neighbors 
and can operate only with knowledge of the link loss rates. 

NUM in coded systems. The NUM framework can be 
applied in networks, to understand how different layers and/or 
modules (such as flow control, congestion control, routing, 
etc.) should be restructured when NC is used. Although 
the approach is general, the parts and interpretation of the 
distributed solution is highly problem-specific. For NUM to 
be successful, the optimization model must be formulated so 
as to capture and exploit the NC properties. This is highly 
non-trivial and problem-specific. A body of work has looked 
at the joint optimization of NC of unicast flows, formulated 
in a NUM framework. 



Optimal scheduling and routing for COPE are considered 
in (9l and ifTTIl . respectively. A linear optimization framework 
for packing butterflies is proposed in [4]. A re-transmission 
scheme for one-hop NC is proposed in lfl4l . Forward error 
correction over wireless for pairwise NC is proposed in lfT31 . 
Ifl6l . which are also the most closely related formulations to 
ours. Our main differences are that we consider: (i) multiple 
flows coded together instead of pairwise, (ii) local instead 
of end-to-end redundancy, and (iii) the effect of losses over 
direct and overhearing links, to generate the right amount of 
redundancy. 

Dealing with wireless loss. Recent studies of IEEE 802.11b 
based wireless mesh networks ifTTl . ff8l . have reported packet 
loss rates as high as 50%. Dealing such level of loss in wireless 
networks is a hard enough problem on its own, which is further 
amplified by NC. There is a wide spectrum of well-studied 
options for dealing with loss, e.g., using redundancy and/or 
re-transmissions, locally (MAC) and/or end-to-end (transport 
layer). Local re-transmissions increase end-to-end delay and 
jitter, which, if excessive, may cause TCP timeouts or hurt 
real-time multimedia. Furthermore, the best re-transmission 
scheme for network coded packets varies with the loss proba- 
bility and it is hard to switch among re-transmission policies 
when the loss rate varies over time. Re-transmission also 
requires state synchronization to perform inter-session NC, 
which is not reliable at all loss rates. We follow an alternative 
approach of local redundancy because we are interested in 
keeping delay low and we want to eliminate the need for 
knowing the state of neighbors. 

There is extended work on TCP over wireless. One key 
problem is the need to distinguish between wireless and 
congestion loss and have TCP react only to congestion; this is 
possible e.g., through Explicit Congestion Notification (ECN). 
When re-transmissions exceed the delay budget, end-to-end 
redundancy may also be used to combat loss on the path 
|fl9l . The error-correcting capabilities of intra-session NC 
have recently been used in conjunction with the TCP sliding 
window in |20l . In contrast, we focus on one-hop inter-session 
coding rather than end-to-end intra-session coding. 

III. System Overview 

We consider multi-hop wireless networks, where intermedi- 
ate nodes perform intra- and inter-session NC (I 2 NC). Next, 
we provide an overview of the system and highlight some of 
its key characteristics. 

A. Notation and Setup 

1) Sources and Flows: Let S be the set of unicast flows 
between source-destination pairs in the network. Each flow 
s £ S is associated with a rate x s and a utility function U s (x s ), 
which we assume to be a strictly concave function of x s . 

1 We have observed through simulations that if a network coded packet is lost for one 
receiver but received correctly for other receiver(s), it is better to re-transmit the same 
network coded packet for low loss rates. However, it is better to combine the packet 
which is lost in the previous transmission with new packets for high loss rates. 



2) Wireless Transmission: Packets from a source (e.g., Si 
in Fig. [TJ traverse potentially multiple wireless hops before 
being received by the receiver (e.g., Ri). We consider a 
model for interference described in 11221 : each node can either 
transmit or receive at the same time, and all transmissions in 
the range of the receiver are considered as interfering. 

We use the following terminology for wireless. A hyperarc 
(i, 3) is a collection of links from node i £ Af to a non-empty 
set of next-hop nodes J C Af. A hypergraph H = (Af, A) 
represents a wireless mesh network, where Af is the set of 
nodes and A is the set of hyperarcs. For simplicity, h = (i : J) 
denotes a hyperarc, h(i) denotes node i and h(J) denotes the 
set of nodes in J, i.e., h(i) = i and h(J) = J. We use 
these notations interchangeably in the rest of the paper. Each 
hyperarc h is associated with a channel capacity Rh- Since h 
is a set of links, Rh is the minimum capacity of all the links 
in the hyperarc, i.e., Rh = mhij-g^j)!^ s.t. i £ Af. In the 
example of node / in Fig. Q] h = (I, {B2, A2}) is one of the 
hyperarcs, and its capacity is min{i?{ 7 B2 j, ,a 2 }}- 

Note that with both intra- and inter-session NC, it is possible 
to construct more than one code over a hyperarc h. Let ICh 
be the set of inter-session network codes over a hyperarc h. 
Sk C S be the set of flows coded together using code k £ JCh 
and broadcast over 

Given TL, we can construct the conflict graph C = (A, I), 
whose vertices are the hyperarcs of "H and edges indicate 
interference between hyperarcs. A clique C q C A consists of 
several hyperarcs, at most one of which can transmit without 
interference, i.e., a transmission over a hyperarc interferes with 
transmissions over other hyperarcs in the same clique. 

3) Loss Model: A flow s may experience loss in two forms: 
loss p s h over the direct transmission links; or loss p h ,s k of 
antidote^] on overhearing links. These two types of loss have 
different impact on network coded flows. 

First, let us discuss loss on the direct links. A flow s trans- 
mitted over hyperarc h experiences loss with probability p h . 
This probability is different per flow s, even if several flows 
are coded and transmitted over the same hyperarc h, because 
different flows are transmitted to different next hops, thus see 
different channels. For example, in Fig.Q] p^ ^ A is equal 

to the loss probability over link / — A 2 and pfj{g 2 ^ 2 }) is 
equal to the loss probability over link I — B^. 

Second, let us discuss the effect of lost antidotes on the 
overhearing link. Consider that flow s is combined with flow 
s' s.t. s ^ s', and that some packets of flow s' are lost on 
the overhearing link to the next hop of s. Then, coded packets 
cannot be decoded at the next hop and flow s loses packets, 
with probability pf'l . For example, in Fig. Q] packets from 
flow 5i cannot be decoded (hence are lost) at node A2 due to 
loss of antidotes from flow S2 on the overhearing link B\ — A2. 

In our formulation and analysis, we assume that p s h and pf'i 

2 Note that we consider constructive inter-session NC, i.e., network codes 
k £ as well as h = (i, J) is determined at each node with periodic 

control packet exchanges or estimated through routing table. 

3 Following the poison-antidote terminology of (4), we call "antidotes" the 
packets of flows s' that are coded together with s, and thus are needed for 
the next hop of s to be able to decode. E.g., in FigfT] a is the "antidote" that 
B2 needs to overhear over link A\ — B2, to decode a + b and obtain b. 
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are i.i.d. according to a uniform distribution. However, in our 
simulations, we consider a Rayleigh fading channel model. 
The loss probabilities are calculated at each intermediate node 
as explained later in this section. 

4) Routing: Each flow s G S follows a single path V s C M 
from the source to the destination, which is pre-determined by 
a routing protocol, e.g., OLSR or AODV, and given as input to 
our problem. Note that the nature of wireless networks is time 
varying, i.e., nodes join and leave the system dynamically. In 
such cases, the routing protocol actively determines new paths 
which are used as input to our problem. It is not critical that 
the paths remain fixed, neither from a theoretical nor from a 
practical point of view, as explained in the following sections. 
Also, note that several different hyperarcs may connect two 
consecutive nodes along the path. We define Hi fe = 1 if s is 

transmitted through hyperarc h using network code k e Kh\ Fig. 2. Operations taking place at end-points and intermediate nodes. 

and Hi k = 0, otherwise. 
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B. Intra- and Inter-session Network Coding 

Next, we give an overview of how an intermediate node 
performs intra- and inter-session NC. The implementation 
details are provided in Section [V] 

1) Intra-session Network Coding (for Error Correction): 
Consider the commonly used generation-based NC ||231 : pack- 
ets from flow s G S are divided into generations (note that 
we use "generation" and "block" terms interchangeably), with 
size G s . At the source s, packets within the same generation 
are linearly combined (assuming large enough field size) to 
generate G s network coded packets. Each intermediate node 
along the path of flow s adds P s parity packets, depending 
on the loss rates of the links involved in this hop. At the next 
hop, it is sufficient to receive G s out of G s + P s packets. 
The same process is repeated at every intermediate node until 
the receiver receives G s error-free packets, which can then be 
decoded and be passed on to the application. 

There are many ways to generate parities (P s ) in prac- 
tice. We use generation based intra-session NC ll23l for this 
purpose. Although one could use various coding techniques, 
such as Reed-Solomon or Fountain codes, using intra-session 
NC has several advantages. First, it has lower computational 
complexity. Second, in systems like COPE that already imple- 
ment inter-session NC, it is natural to incrementally add intra- 
session NC functionality. Moreover, in this setting, hop-by-hop 
intra-session coding (in which redundant packets are generated 
at each hop) is clearly a better choice than end-to-end coding 
for dealing with loss. In terms of performance, hop-by-hop 
coding achieves higher end-to-end throughput (thanks to in- 
troducing less redundancy than end-to-end coding), without 
adding high complexity (and thus delay) to the intermediate 
nodes. Furthermore, in terms of system implementation, our 
hop-by-hop scheme requires minimal modifications on top of 
the inter-session NC, which is already implemented. 

2) Inter-session Network Coding (for Throughput): After 
an intermediate node has added redundancy (P 3 ) to flow s, it 
treats all (G s + P s ) packets as indistinguishable parts of the 
same flow. Inter-session NC is applied on top of the already 
intra-coded flows, as a thin layer between MAC and IP (similar 



to COPE), shown in Fig. [2] We design two schemes, I 2 NC- 
state and I 2 NC-stateless, depending on the type of information 
that is needed to make network coding decisions. We define 
as state of a node the information about which exact packets 
have been overheard at that node. 

I 2 NC-state: First, we assume that intermediate nodes use 
COPE for inter-session coding. Each node i listens all 
transmissions in its neighborhood, stores the overheard packets 
in its decoding buffer, and periodically advertises the content 
of this buffer to its neighbors. When a node i wants to transmit 
a packet, it checks or estimates the contents of the decoding 
buffer of its neighbors. If there is a coding opportunity, 
the node combines the relevant packets using simple coding 
operations (XOR) and broadcasts the combination to J . The 
content of the decoding buffers needs to be exchanged, in order 
to make NC decisions, i.e., state synchronization is required. 

I 2 NC-stateless: Second, we design an improved version of 
COPE, which no longer requires state synchronization. The 
key idea is to exploit the fact that the redundancy already 
introduced by intra-session codingmakes all G s + P s packets 
in a generation equally importantQ In this improved scheme, 
each node i still listens to all transmissions in its neighborhood 
and stores the overheard packets^ The node periodically 
advertises the loss rate for each received and overheard flow, 
which is then provided as input to the intra-session NC module 
to determine the amount of redundancy needed. In particular, 
the loss rates are calculated at each intermediate node as 
one minus the ratio of correctly received packets over all the 
packets in a generation. Also, the loss rate over overhearing 
links is calculated as effective loss rate. E.g., in Fig.Q] the loss 
rate at node is calculated as follows. If G Sl +P Sl packets 
are sent by B\ and at least G Sl packets are received at A%, 
then the loss rate is set to 0. If G Sl — u packets are received 
by Ai such that u < G Sl , then the loss rate is set to u/G Sl . 
The loss rates calculated for each generation are advertised to 
other nodes in the neighborhood. Then, each node calculates 

4 It no longer matters which exact packets a node has. As long as a node 
has any G s out of G s + P s , it can decode with high probability. As long as 
it knows the percentage of received packets it can make coding decisions. 

5 Note that when inter-session network coded packets are overheard, they 
are not stored in the "decoding buffer", but discarded. 



its loss probabilities (pi and p^ s k ) as weighted average of the 
loss rates it has received. 

In summary, there is a synergy between intra- and inter- 
session NC. Intra-session makes the process sequence agnos- 
tic, which allows inter-session coding to operate using only 
information about the loss rates, not about the identity of the 
packets. The loss rates can be used as input for tuning the 
amount of redundancy in intra-session NC. In terms of im- 
plementation, the two modules are separable: an intermediate 
node first performs intra-session, then inter-session NC. 

IV. Network Utility Maximization Formulation 

A. I NC-state Scheme 

1) Formulation: Our objective is to maximize the total 
utility function by optimally choosing the flow rates x s at 
sources s 6 S, as well as the following variables at the 
intermediate nodes: the fraction a s h k (or "traffic splitting" 
parameters, following the terminology of 112410 of flows inter- 
session coded using code fc £ fCh over hyperarc h; and the 
percentage of time T%,k each hyperarc is used. 



max y U s (x s ) 



S.t. 



E H h,k a h,k X s'Ph% ^ R hTh,k, 



H h,k a h,k X s 

s>es k -{s} 
Vh e A, fc g fC h ,s e S k 

E E «fc,k = 1 > VseS,ieV s 

h{J)\h£Ak£K h \seS k 
E E Th > k ^ 7> VC " ^ A 

hec q k&K h 



(1) 



The first constraint is the capacity constraint for each flow s £ 
Sk- It is well-known, 11251 . that NC allows flows that are coded 
together in code k £ ICh, to coexist, i.e., each have rate up to 
the rate allocated to that code k. The right hand side, RhTh,k< 
is the capacity of hyperarc h; Th,k is the percentage of time 
hyperarc h can be used for transmitting the fc-th network code. 
Th,k is determined by scheduling in the third constraint, taking 
into account interference: all hyperarcs in a clique interfere and 
should time-share the medium. Therefore, the sum of the time 
allocated to all hyperarcs in a clique should be less than an 
over-provisioning factor, 7 < 1. The second constraint is the 
flow conservation: at every node i on the path V s of source s, 
the sum of a s h k over all network codes and hyperarcs should 
be equal to 1. Indeed, when a flow enters a particular node 
i, it can be transmitted to its next hop j as part of different 
network coded and uncoded flows. 

The first constraint is key to our work because it determines 
how to deal with loss on the direct (p s h ) and overhearing (p s h ,s k ) 
links and how large a fraction (a s h k ) of flow rate (x s ) to code 
in the fc-th code over hyperarc h. Let us discuss the left hand 
side in more detail 

6 Note that our formulation has two novel aspects, compared to prior work, 
which allow us to better handle loss and parities. First, we allow for flows 
coded together to have different rates (in the first constraint in Eq. {T}). 
Second, we allow for loss rates of each link to be specified separately, even 
for links in the same hyperarc. 



The first term refers to the direct link of flow s. k a s h k x s 
is the fraction of flow rate x s allocated to code k and hyperarc 
h. It is scaled by 1 — p s h to indicate that we use redundancy 
to protect against loss that flow s experiences with probability 
p s h . (H^ k a s h k x s)/0-—Ph) is the total rate of flow s, including 
data and redundancy. 

The second term refers to loss on the overhearing links. 

E s ' eSfe -{s} H ik a h,k x s' P^k is the amount of redundancy 
(via intra-session coding) added by the intermediate node on 
flow(s) s' to protect flow s against loss of antidote packets. 
These antidotes come from other flows (s' £ ICh) that are 
coded together with flow s, reach the next hop for flow s 
through the overhearing links, and are needed to decode inter- 
session coded packets. 

Example 1- continued. In Fig. Q] let us consider flow 2 from 
Bi to £?2, as the flow of interest. The intermediate node / adds 
redundancy to S% to protect against loss rate pfj{B 2 A 2 }) on 

the direct link / — £>2- It also adds redundancy to flow 1 to 

s s 

protect against loss rate p^^g 2 of antidotes coming to 
B 2 from flow 1 over the overhearing link A\ — B 2 . 

2) Optimal Solution: To solve Eq. (Q]i we follow a sim- 
ilar approach proposed in [36). First, we relax the capacity 
constraint in Eq. (Q3, and we have the Lagrangian function: 

L(x,a,T,q) =E^s«)-E E J2llk( 

s£S hGAkeK h seS k 



l h,k a h.k x s 



Pi 



s>£S k -{s} 



H h,ka S h ,k X s'Phl ~ R hTh,, 



(2) 



where k is the Lagrange multiplier, which can be in- 
terpreted as the queue size for fc-th network code at hy- 
perarc h for flow s. We define pf'l = if s = 

s',Vs,s' £ S and we rewrite }ZkeK h E seSfc as E se< s 
J2keK h \ses k - The Lagrange function is L(x,a,T,q) = 

T,s & s( u '( x *)- X ° TrheA2~2keic h \s&s k H h,k a h,M q h,k) 

Ph) + J2 s 'es k (lh,kP S h,k)) + T,heAT,k(iK: h £ se5fc 1h,k R hTh,k- 
It can be decomposed into several intuitive parts (rate control, 

traffic splitting, scheduling, and queue update), each of which 

solves the optimization problem for one variable. 

Rate Control. First, we solve the Lagrangian w.r.t x s : 



(u'.r 1 (E, ePs Qt) 



(3) 



where (L^) -1 is the inverse function of the derivative of U s , 
and Ql is the occupancy of flow s at node i and expressed as 

Qi = J2h(J)\heA J2kelC h \seS k ^h,k a h,kQh,k^ W 

where Qi k is the queue size of flow s associated with 
hyperarc and network code pair {h, k}: 



Qh,k — l-ps +Es'£Si-{s} 1h,kPh,k 



(5) 



Traffic Splitting. Second, we solve the Lagrangian for a s h k . 
At each node i along the path (i.e., i £ Vs), the traffic splitting 
problem can be expressed as follows: 



(J)\heA J2keK h \seS k a fi,k^h,kQh,k 



s -t- J2h(J)\h&A2ZkEK h \seS, 



l h.k 



(6) 



Let us assume that Ei[Q(t)} is the maximal Q^t) at time 
t such that Q t (t) = j^E^w^iQU*) with 
A{t) := = {/»(J)>}K, fc > OorH° htk Ql k < 

Qi(t),h(J) € Af si. h e A, k e JC h }. At each node 
i, the amount of traffic splitting factor a s h k for flow s 
over hyperarc h and code fc follows; a s h k = m[Ei[Q] — 
H'h kQh kla s ' where is a positive constant, and [6] + = 
& if z > ' and [6]+ = if 6 < and z = 
0. It can be seen that Eh(j)\heA Y,keK h \seS k <*M = 
and E h (J)\h&A^k£K h JseS k ®t,k H h,kQh,k ^ 0. Also, 
Y.h(j)\heAY.keK h \ses k a kk H lkQ S h.k = onl y if «h,k = 
which is possible only if Hj[ k Q s h k > Q { , and 

<AH s h , k Ql. k -Q t ) = o. 

The structure of the optimal solution of Eq. © (i.e., a hk — 
Ki[Ei[Q] — k Q s h fc ]^ s ) has the following interpretation: 

the higher the loss rate of antidotes on overhearing links p s h '?, 
the higher Q s h k , and the smaller a s h k . This means that flow 
s should code fewer packets with packets from fiow(s) s' in 
code k, when antidotes from s' are likely to be lost. 

Example 1 - continued: In Fig. [TJ this means that / should 
combine fewer packets from the two flows if there is loss on 
the overhearing link A\ — B-i- In the extreme case where loss 
rate is 1 over the link A\ — £>2, inter-session coding should be 
turned off. At the other extreme, where there is no loss, the 
two flows should always be combined. ■ 

Scheduling. Third, we solve the Lagrangian for 77,^. This 
problem is solved for every hyperarc and every clique for the 
conflict graphs in the hypergraph. 

m axE/ ie .4 T,keic h T,ses k Qh,k R hTh,k 
^Ehee q ^keK h ^,k<r,yC q CA (7) 

Let us assume that Q hyk = RhJ2 s es k 1h,k> and E c q [Q(t)} 
is the minimal Q Cq (t) at time t such that; Qc q (t) = 
\ A >^ (t) \ ^4>eA' Cq Qh,k(t) with A 'c q : = W = {MIKfc > 

or Qh.k(t) > Q Cq (t),h 6 A,k £ JC h }. At each clique C q , 
the fraction of the time that is allocated to hyperarc h, 
and code k is as follows; r h . k = e Cq [Qh,k - E Cq [Q]}t hk , 
where ec is a positive constant and [&]+ = b if z > 
and [6]+ = if b < and z = 0. It can be seen that 
Efcee, Eke*:,, ^ h,k = and J2hec q T, k£ K h T KkQh,k > 0. 
Also ' EheC, SfeeKfe T h.kQh,k = only if T h . k = which 

requires that Q hik = Q Cq or Th,k = and Q, hk < Q Cq . 

Queue Update. We find the Lagrange multipliers (queue 
sizes) k , using the gradient descent: 

< fc (^+i) = M fc w+Q{ H "r-1f 3 + 



J2s'es k ~{s} H h,k a ik x s'PhX ~ RhT h ,k}Y 



(8) 



where t is the iteration number, q is a small constant, and 
the + operator makes the Lagrange multipliers positive, k 
is interpreted as the queue for flow s allocated for the k- 



th network code over hyperarc V/i € A. Indeed, in Eq. (©, 
q^ k is updated with the difference between the incoming 

{ H h,k a h,k x s)/( l " Ph) + E s >es h -{s} H (k a ik x s'Phi and 
outgoing RhTh.k traffic rates at h\j 

B. I 2 NC-stateless Scheme 

The second term in Eq. (Q]) describes the redundancy added 
by node i to protect flow s from loss of antidotes on the 
overhearing link. An implicit assumption was that node i 
knows what antidotes are available at the next hop and uses 
only those packets for inter-session coding. However, this 
knowledge can be imperfect, especially in the presence of loss. 
Here, we formulate a variation of the problem, where such 
knowledge is not necessary. Instead, node i needs to know 
only the loss rate on all the links to the next hop for flow s 
(e.g., in Fig. Q] for flow 2 (S2), these are links I — B2 and 
At - B 2 ). 

We replace the capacity constraint in Eq. ([TJ with: 



65 fc -{s} 



H h,k a h, k X s'P S h 



-Ph 



< RhT h , k (9) 



and this is V/i G A, k 6 ICh,s € Sk- The other constraints 
remain the same as in Eq. (Tj~|). The difference from Eq. ([TJ is 
in the second term, related to the overheard packets at the next 
hop. Any fraction of flow ,s' added as redundancy to flow s, 
as well as overheard packets from s' in the next hop, help to 
decode inter-session coded packets of s with flow s'. To protect 
transmissions of these "helping" fractions (H^ k a s h k x s ip s ^ s k ) 
from being lost on the direct link to the next hop of flow s 
(e.g., from / to B2), we add redundancy to match the loss 
rate of that direct link (p| in general, B , in the example). 

This is why the term k a s h k x S 'p s ^ s k is divided by 1 — p s h . 

The solution of this optimization problem also decomposes 
into rate control, traffic splitting, and scheduling problems, 
which correspond to Eq. (O, ©, and (0, respectively. Q s h k 
needs to be updated: 



)S _ Ih.k 

<h,k - 1-pl 



ih,kPh'k 



Js'ES k -{s} i_ p =' • 

The Lagrange multiplier is updated as follows; 

€At+i)={<k(t)+c t { Hi cir + 



(10) 



H 



s'eS fc -{s} 



h, k a h. k x 3' Ph. k 



R h T h , k }Y 



(ID 



We provide the convergence analysis of our solution in Ap- 
pendix A@ We first give the proof of convergence, then we 
verify the convergence through numerical calculations. 

7 Note that the queue update in Eq. (8) can be re-written as; cf h k = 



s k -{s) H h,k a h,k x s'Ph" k ~ R h- T h,k]gS k , where 



"/hi !_ p s TZ^s'eS fc 

-fh is a positive constant. 

8 We do not claim that the solution of our network utility maximization 
problem is the optimal solution to the general intra- and inter-se ssio n NC 
problem over wireless networks. This is well-known, open problem [26], [27|, 
1281 . Even without an optimal, closed form solution, there is still value in 
using the structure of the solution to design mechanisms that perform well 
in practice, as we show through the numerical and simulations results in the 
next sections. 



V. System Implementation 

We propose practical implementations of the I 2 NC-state 
and I 2 NC-stateless schemes (Fig. 01, following the NUM 
formulation structure. 

A. Operation of End-Nodes 

At the end nodes, there is an adaptation layer between 
transport and intra-session NC which has two tasks: (i) to 
interface between application and intra-session NC; and (ii) to 
optimize the reliability mechanism at the transport layer. 

Task (i): At the source, the adaptation layer sets the gen- 
eration (block) size G s . G s is set according to application; 
e.g., media transmission requirements for UDP, or set equal 
to TCP congestion window for TCP applications and changes 
over time. The adaptation layer receives G s original packets 
Pi,P2, ■■■,Pg b from the transport layer of flow s and generates 
G s intra-session coded packets; a\ = p±, a2 = pi + P2, 
«G s = Pi + --.+PG 3 - We call this coding "incremental additive 
coding". We chose the incremental additive coding to avoid 
introducing coding delays (i.e., our algorithm does not need to 
wait G s packets to encode packets) as proposed in 0201 . The 
intra-session header includes the block id, packet id, block 
size, and coding coefficients. At the receiver side, the reverse 
operations are performed. 

Task (ii): To further optimize the interaction between I 2 NC 
and transport, particularly TCP, we keep track of and acknowl- 
edge the number of received packets in a generation, rather 
than their sequence numbers (note that this part is not needed 
for UDP protocol). This idea is similar to the use of end-to-end 
FEC and intra-session NC that make TCP sequence agnostic 
|fl9l , |29l , l20l . E.g., if a receiver receives the first packet 
labeled with block id g s = 1, then it generates an ACK with 
block id g s = 1 and packet id if = 1. The uncoded packets, 
Pi,P2, ...,pg s , are stored in a buffer at the source for TCP 
ACK adaptation. E.g., if an ACK for block id g s = 1 and 
packet id 77 s = 1 is received by the source, then the TCP 
adapter matches this ACK to packet p\ and informs TCP that 
packet pi is ACK-ed. As long as the TCP receiver transmits 
ACKs, the TCP clock moves, thus improving TCP goodput. 
After the ACK with the block and packet ids is transmitted 
by the TCP receiver, the packet is stored at the receiving 
buffer. When the last packet from a generation is received, 
then packets are decoded and passed to the application. 

B. Operation of Intermediate Nodes 

An intermediate node needs to take a number of actions 
when it receives (Alg. [TJ or transmits (Alg. |2} a packet. 

1 ) Receiving a packet and intra-session network coding: 
Buffer packets. A node i may receive a packet from higher 
layers or from previous hops. In the latter case, if the re- 
ceived packet is inter-coded, it is decoded and the packet 
with destination to this node is stored (or is passed to 
transport if it is the last hop). If it is not the last hop, a 
packet ai e {a±, az, ac} is stored in the output queue 
Qi. In addition to the physical output queue Qi, the node 
i keeps track of several virtual queues; Q s h k per (flow, 



Algorithm 1 Node i processes packet a/ from flow s. 

1: Read the information: packet ai, from flow s (generation size G s ) 

2: Insert ai into the physical output queue Qi. 

3: Determine {h* , /c*} and label a/ with {h* , k*J pair and s 

4: Update q s h * fc * (using Eqs. (8} and Q It ) and q^* fc * 

5: Calculate Q" h , fc » (using Eqs. (5} and {TO}) and Q\ (using Eq. (4j) 

6 : <3fi* ,fc» = G s h * ,k* + 1 

7: if G s packets from flow s are received at node i then 
8: Calculate the number of parities P^'^., P^ f 
9: Create P^ 'f parities from s and P^ f parities from s' 
10: Label all generated parities with {h, k} pair and s 



hyperarc, code). The packet ai is labeled with (h*,k*,s), 
which essentially indicates whether and how to code this 
packet according to the traffic splitting in Eq. ©: we pick 
{h*,k*} — argmin^fcliJ^ k Q s h k }, randomly breaking ties. 
Note that this labeling is local at the node, and does not 
introduce any transmission overhead. 

Note that Hf t k is the indicator whether flow s is transmitted 
over hyperarc h with code k. This indicator is determined at 
each node using a routing table which has a data structure to 
determine the next hops (note that paths do not need to be 
known by the sources or any node in the system). Basically, if 
a packet from flow s is able to reach to the next hop determined 
by the routing table when it transmitted over hyperarc h and 
with code k, then the indicator is set to 1, otherwise 0. We 
also note that in this system, as long as paths remain fixed 
for longer (at least longer than a time required to transmit 
a packet) time periods, we can see more benefit from NC, 
because each node will learn which flows can be network 
coded and estimate the loss rates better as time gets longer. 
However, even in the extreme case in which paths change 
very fast (say for example at every packet transmission), 
our system works well, but it does not fully exploit NC 
opportunities, since it cannot estimate whether NC is possible 
or not. However, it works not worse than a system without NC. 
Therefore, I 2 NC is designed to adapt to path changes and to 
exploit NC benefit if possible. 

Update Virtual Queue Sizes. When packet ai is selected 
to be transmitted with the fc*-th network code over hyperarc 
h*, the virtual queues; Q s h , k , and q^, k , should be updated. 
q^, k , is updated according to Eqs. © and (fTTT i. Q^, k , is 
calculated according to Eq. (0 for I 2 NC-state and Eq. (TTOb for 
I 2 NC-stateless. Qf is calculated according to Eq. Then, the 
number of packets k , from the same generation that are 
allocated to h* , k* pair is incremented: G^, k , = G\, fe » + 1. 
G| k is set to for each new generation. 

Generate Parities. After G s packets from a generation of 
flow s are received at node i, P s parity packets are generated 
via intra-session NC (which is performed according to random 
linear NC |30|) and labeled with information (s, h, k). There 
are two types of parities. 

• Phi = kPh/fi ~ P ar iti es af e added on flow s's 
virtual queue to correct for loss during direct transmission 
to the next hop over hyperarc h. 

• Phk = l^hkPh'kl' ^ s> e parities are added on 
the virtual queues of other flows s' that are inter-session 
coded together with s. This is to help the next hop for s' 
to decode despite losses on the overhearing link. 



Algorithm 2 Node i transmits a packet. 

1: Select {h\ k^} pair that maximizes Qh.k — %(Zl s e5 Qh k ) 

2: Initialize: £ = 

3: for ai 6 do 

4: if a; is labeled with {h*, fc^} AND flow id label of ai is different from 
Va ; , e 5 then 

5: if I 2 NC-state AND £ U oj is decodable OR I 2 NC-stateless then 

6: Insert packet to £ 

7: Network code (XOR) all packets in £ 

8: Broadcast the network coded packet over hyperarc 

9: Update g» f fct , Vs e S k 

10: Re-calculate Q h fc = R h(Y, s eS k Vh,k) and <2i (usin 8 E 1- 0) 



These parity packets are for I 2 NC-state. For I 2 NC-stateless 

p hi is the same - but p hi s = \ G h,kPh,k/( 1 - p s h)i Le - 

additional redundancy is used to protect parity packets from 
loss on the direct link. 

2) Transmitting a packet and inter-session network coding: 
We consider the 802. 1 1 MAC. When a node i accesses a chan- 
nel, {h\ k^} is chosen to maximize Qh,k = Rh(52 S £S 1h k) 
according to Eq. (0, randomly breaking ties. Although the 
pair {h',k'} determines the hyperarc, code and flows to be 
coded together in the next transmission, the specific packets 
from those flows still need to be selected and coded. We call 
these packets the set £, and select them using the procedure 
specified in Alg. [2@ 

To achieve this, we first initialize the set of network coded 
packets £ = 0. For each packet a; e Qj, check whether 
a; is labeled with {ht,k'}. If it is, then we check whether 
its flow id label already exists in one of the packets in £, 
i.e., another packet from the same flow has already been 
put in £. If not, there is one more check for I 2 NC-state for 
decodability at the next hops of all packets in the network 
code, based on reports or estimates of overheard packets in 
the next hops, similarly to 0. If the packet is decodable 
with some probability larger than a threshold (default value 
is 0.20) then, a; is inserted to £. In I 2 NC-stateless, the packet 
ai is inserted to £ without checking the decodability, which is 
ensured through the additional redundancy packets. This is the 
strength of I 2 NC-stateless: it eliminates the need to exchange 
detailed state, which is costly and unreliable at high loss rates. 
After all packets in Qj are checked, the labels (h, k, s) of the 
packets in £, inter-session NC header is added, and coded 
(XORed) and broadcast over h. 

After a coded packet is transmitted, the virtual queues are 
updated according to Eqs. (8), dTTY The queues Qhf. fct an d 
Qf are calculated according to Eqs. ©, dTOl i, <f4T> J 1 °l 

We note that in both I 2 NC-state and stateless, packets are 
network coded if some conditions are satisfied. However, if 
these conditions are not met, a packet without NC is still trans- 
mitted, because at least one packet is inserted in £ (Alg. [2]). 
Thus, we do not delay any packets in our schemes. Yet, 
delaying packets may create more NC opportunities and there 
is a tradeoff between delay and throughput. These issues have 

'The inter-session NC header includes the number of coded packets 
together, next hop address, and the packet id's. Note that this header as well 
as the IP header of each packet are not network coded. 

10 Note that I 2 NC may cause re-ordering at the receiver, but since we already 
implemented intra-session NC, and made TCP receiver sequence agnostic in 
this term, out of packet delivery is not a problem for TCP. 



been considered in some previous work BP . 11321 . However, 
this is an aspect orthogonal to the focus of I 2 NC (which is 
the synergy between inter- and intra-session NC) and can 
potentially be combined with it. 

3) Keeping Track of and Exchanging State Information: 
For I 2 NC-state, intermediate nodes need also to keep track of 
and exchange information with each other, so as to enable the 
intra- and inter-session NC modules to make their redundancy 
and coding decisions and to provide reliability. An approach 
similar to COPE is used: ACKs are sent after the reception and 
successful decoding of a packet. Information about overheard 
packets is piggy-backed on the ACKs. With I 2 NC-stateless, 
we only need neighbors to exchange information about the 
loss rates at the neighboring nodes. Information about the loss 
rates as well as the number of received packets at a generation 
is reported through control packets for every generation^ In 
order to provide reliability, we consider re-transmissions. In 
I 2 NC-state, a packet is removed from the output queue only 
after an ACK related to the packets is received. Otherwise, 
the packet is re-transmitted after a round trip time. In I 2 NC- 
stateless, packets are removed from the output queue when 
a control packets is received and confirms the successful 
transmission of all packets of the corresponding generation. 
Otherwise, a number of intra-session coded packets from the 
generation which are missing at the receiver are generated 
from the packets kept in the queue and transmitted. 

4) Congestion Control and Queue Management: End-to- 
end congestion control {i.e., rate control) is given by Eq. (0 
in which if U s {x s ) = log(x s ), then x s = 1/ (J2iev s Qt)- 
This means that flow rate x s is inversely proportional with 
increasing queue size over the path of flow s. This behavior 
is similar to TCP's end-to-end congestion control algorithm, 
where congestion at a node may result in one or more packets 
may be dropped from the buffer at this node. TCP reacts to 
packet drops by reducing its rate. Thus, TCP reduces its flow 
rate when queue size increases. This gives us intuition that 
TCP mimics the rate control part of the decomposed solution. 
This intuition has been validated in Q, (33], (34), 11351 . 

Similarly, we consider that TCP already mimics the struc- 
ture of the rate control part in Eq. ||3). Therefore, upon 
congestion at node i, the per-flow queue sizes Qf are compared 
and the last packet from flow s having the largest Qf is 
dropped from the queue; in case of a tie, an incoming packet 
is dropped. We do not make any additional updates to TCP's 
end-to-end congestion control algorithm. Also, we do not 
implement any end-to-end congestion control mechanism for 
UDP Our goal is to keep UDP as it is (without any end-to-end 
control) and show the effectiveness of I 2 NC-state and I 2 NC- 
stateless when there is no end-to-end control. 

Example 2: Let us re-visit the X-topology from Fig. [TJ 
shown again for convenience in Fig. [3] and illustrate how 
we perform intra- and inter-session NC under scheme I 2 NC- 
stateless. The loss probabilities over the direct (/ — B2) and 

"in our implementation, the loss probabilities are calculated as weighted 
average of the loss rates. The weighted average is calculated over a window 
of 10 samples. The last 10 samples are ordered such that the newest sample 
is the first sample, and the oldest sample is the 10 th sample. Each sample is 
given a weight inversely proportional to its sample number. 
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Fig. 3. Example of coding (under scheme I 2 NC-stateless) at node I in the 
X-topology. There is loss only on two links: the direct link I — B2 (with 
probability 0.5) and the overhearing link Ai — B2 (with probability 0.25). 

overhearing (A\ — B 2 ) links are assumed 0.5 and 0.25. 

In Fig. [3ja), we describe intra-session NC. Let us assume 
the generation size of Si is G Sl = 4 and S 2 is G S2 = 1. 
The packets transmitted by A\, B\ are 01,02,03,04 and 61, 
respectively. Note that there is only one option for inter- 
session NC, i.e., to XOR packets from the two flows, thus 
there exists only one possible network code k = 1 over 
hyperarc h = (I, {B 2 , A2}). All packets are labeled with 
this information and their flow ids. The labeled packets are 
^, s i o S i 



,a 2 
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Si 



1, Si 



, of 1 and bf 2 . Parities are generated as follows. 

4 and Gj^ B ^ A ^ = 1, the number of 

parities is P^ M} = 0, P%£ A * } = 0, Pf\f ?>A2} = 1 
(thus generating one parity from flow S 2 and labeling it with 
S2, i.e., of 2 ), anc ^ \bI a 2 } = ^ ( tnus generating two parities 
from flow Si and labeling them with S2, i.e., aj? 2 ,aj? 2 ). 

In Fig.[3jb), we describe inter-session NC. Node / performs 
inter-session NC and transmits packets according to Alg. [2J it 
XORs packets from the two queues, for Si, 5*2, and broadcasts 
over the hyperarc (I, {B 2 , A 2 }). In particular, it transmits the 



following packets: of 1 ©frf 2 , a^ 1 © 



of 2 , 



©of 2 , and of 1 



Og 2 . A 2 receives and decodes all the packets. B 2 receives 
3 packets on the average over overhearing link A\ — B 2 and 
receives 2 packets over transmission link I — B 2 . Five received 
packets allows B 2 to decode all five packets a±, a 2 , 03, 04, b\, 
so 61 is successfully decoded. ■ 



VI. Performance Evaluation 

A. Simulation Setup 

We used the GloMoSim simulator [8], which is well suited 
for simulating wireless environments. We considered various 
topologies: X topology, shown in part of Fig. [TJand repeated in 
Fig. Ufa); the cross-topology with four end-nodes generating 



bi-directional traffic, with one relay shown in Fig. |Ub); the 
wheel topology shown in Fig.|4jc); and the multi-hop topology 
shown in Fig. [TJ In X, cross, and wheel topologies, the 
intermediate node / is placed in a center of of circle with 
radius 90m over 200m x 200m terrain and all other nodes 
A±, B\ and etc. are placed around the circle. In the multi- 
hop topology of Fig. [TJ two X topologies are cascaded and 
the distance between consecutive nodes is set to 90m. The 
topology is over a 800m x 300m terrain. 

We also considered various traffic scenarios: FTP/TCP and 
CBR/UDP. TCP and CBR flows start at random times within 
the first 5sec and are on until the end of the simulation which 
is 60sec. The CBR flow generates data packets at every 0.1ms. 
IEEE 802.11b is used in the MAC layer, with the addition of 
the pseudo-broadcasting mechanism, as in COPE Q. In terms 
of wireless channel, we simulated the two-ray path loss model 
and a Rayleigh fading channel with average channel loss rates 
0,20,30,40,50 %. We have repeated each 60sec simulation 
for 10 seeds. Channel capacity is 1Mbps, the buffer size at 
each node is set to 100 packets, packet sizes are set to 500B, 
the generation size is set to 15 packets for UDP flows and to 
the TCP window size for TCP flows. 

We compare our schemes (I 2 NC-state and I 2 NC-stateless) 
to no network coding {noNC), and COPE (2), in terms of total 
transport-level throughput (added over all flows). 

B. Simulation Results 

TCP Traffic. In Fig. [5] we present simulation results for two 
TCP flows in X topology shown in Fig. Ufa) to illustrate the 
key intuition of our approach. Consider, for the moment, that 
loss occurs only on one link, either (a) the overhearing link 
A\ - B 2 or (b) the direct link I - B 2 . 

The first case is depicted in Fig. 0a). Loss on the over- 
hearing link does not affect the uncoded streams, thus the 
throughput of TCP+noNC does not change with loss rate. 
When NC is employed, reports carrying information about 
overheard packets may be delivered late to intermediate node 
/. Thus, there are some instances that intermediate node should 
make a decision even if it does not have the exact knowledge. 
In this case, / makes a decision probabilistically. Specifically, 
if decoding probability exceeds some threshold (20% in our 
simulations), / codes packets. However, some of these packets 
may not be decodable at the receiver. It is why the performance 
of TCP+COPE and TCP+I 2 NC-state reduce with increasing 
loss rate and equals to the throughput of TCP+noNC after 
20% loss rate (NC is turned off after 20% loss rate). However, 
TCP+I 2 NC-state is still better than TCP+COPE, because when 
it makes probabilistic NC decision (when loss rate is less than 
20%), it adds redundancy considering the loss rate over the 
overhearing link. This improves throughput, because adding 
redundancy using intra-session NC makes all packets equally 
beneficial to the receiver and the probability of decoding 
inter-session network coded packets increases. TCP+I 2 NC- 
stateless outperforms other schemes over the entire loss range. 
For example, if there is no loss, I 2 NC-stateless still brings 
the benefit due to eliminating ACK packets and using less 
overhead to communicate information (i.e., COPE and I 2 NC- 
state exchanges the information about the overheard packets, 
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Fig. 4. Topologies under consideration, (a) X topology. Two unicast flows, S±, R\, and S2, R2, meeting at intermediate node /. (b) Cross topology. Four 
unicast flows, Si, Ri, S2, R2, S3, R3, and S4, R4, meeting at intermediate node /. (c) Wheel topology. Multiple unicast flows Si, Ri, S2, R2, etc., meeting 
at intermediate node I. In all three topologies, I opportunistically combine the packets and broadcast. 



TCP+™c-stateless 
TCP+l 2 NC-state 
-*-TCP+COPE 

-l-TCP+noNC 




20 30 40 

Average loss rate (%) 
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(b) Loss only on direct link 

Fig. 5. X topology in Fig. |4ja). We show the total TCP throughput (added 
over two flows) vs. link loss rate, for two specific loss patterns. Loss happens 
only on one link, either: (a) the overhearing link A\ — B2 or (b) the direct 
link I — £?2. All other links are lossless. 

while I 2 NC-stateless exchanges the information about the loss 
rates), thus using the medium more efficiently. When the loss 
rate increases, the improvement of I 2 NC-stateless becomes 
significant, reaching up to 30%. The reason is that at high loss 
rates, I 2 NC-state and COPE do not have reliable knowledge 
of the decoding buffers of their neighbors and cannot do 
NC efficiently. In contrast, I 2 NC-stateless does not rely on 
this information, but on the loss rate of the overhearing link 
to make NC decision. In the discussion of Example 1, we 
mentioned that at 50% loss rate, 16.6% improvement can be 
achieved via NC. Here, we see this improvement (13%) as 
well as the the additional benefit of eliminating ACK packets 
(12%). Note that the total improvement is 25%. 

The second case is depicted in Fig. 0b). The throughput of 



TCP+noNC decreases with increasing loss rate because, the 
loss is over the direct link and some packets whether they are 
coded or not are lost on the direct link (/ — B2). This leads 
to decrease in throughput level. TCP+I 2 NC-state outperforms 
TCP+COPE in this scenario, because I 2 NC-state corrects 
errors on the direct link thanks to the added redundancy 
which reduces the number of re-transmissions. Thus, I 2 NC- 
state uses the channel more efficiently than COPE and im- 
proves the throughput. Note that TCP+I 2 NC-state outperforms 
TCP+COPE even after 20% loss rate, although inter-session 
NC is turned off after this level. The reason is that although 
I 2 NC-state does not do inter-session NC after 20% loss rate, 
it keeps doing intra-session NC which adds redundancy to 
correct errors. Due to this property, TCP+I 2 NC-state out- 
performs TCP+COPE even at high loss rates. TCP+I 2 NC- 
stateless significantly outperforms all alternatives again due to 
performing NC at all loss rates and eliminating ACK packets. 

Fig. [6] presents simulation results for TCP traffic over X, 
cross, and the multi-hop topologies assuming loss on all links. 
For ease of presentation, here, we report only the results when 
all links have the same loss probability. 

Fig. |6ja) shows the results for the X topology. At low- 
medium loss rates (10% - 30%), I 2 NC-state and COPE are 
still able to do NC, so TCP+I 2 NC-state and TCP+COPE 
improve throughput significantly as compared to TCP+noNC. 
At higher loss rates, I 2 NC-state and COPE do not have reliable 
knowledge of the decoding buffers of their neighbors and 
cannot do NC efficiently. As a result, the improvement of 
TCP+I 2 NC-state and TCP+COPE as compared to TCP+noNC 
reduce with increasing loss rate. TCP+I 2 NC-state is better 
than TCP+COPE at higher loss rates thanks to its error 
correction mechanism. TCP+I 2 NCstateless outperforms other 
schemes over the entire loss range thanks to combining NC 
and error correction as well as eliminating ACKs. For example, 
if there is no loss, TCP+I 2 NC-stateless still brings the benefit 
by eliminating ACK packets, thus using the medium more 
efficiently. When the loss rate increases, the improvement 
of I 2 NC-stateless becomes significant, because I 2 NC-stateless 
does not rely on the knowledge of the decoding buffers of their 
neighbors, but only on the link loss rates for inter-session NC. 

Fig. |6jb) shows the results for the cross topology. The 
improvement of TCP+I 2 NC-stateless is higher as compared to 
the X topology, because there are more NC opportunities here 
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Fig. 6. Total TCP throughput vs. average loss rate (for ease of presentation, the same loss rate is assumed on all links) in three different topologies. 
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Fig. 7. Wheel topology shown in Fig. |4jc) with increasing number of flows. 
Loss rate on all links is set to 10%. 

for I 2 NC-stateless to exploit. We also performed simulations 
with increasing number of flows, (i.e., nodes in this topology); 
the details are provided later in this section. 

Fig. |6jc) presents the results for the multi-hop topology 
in Fig. Q] The improvement of TCP+I 2 NC-state is higher 
than in the X and cross topologies, especially at higher 
loss rates. This is because intra-session coding, employed 
by I 2 NC-state, reduces the dependency on link level ARQ. 
More specifically, in this multi-hop topology, the end-to-end 
residual loss rate increases with the number of hops. Intra- 
session NC overcomes this, thus increasing TCP throughput. 
The improvement of I 2 NC-stateless is even more significant 
for this topology, because the benefit of eliminating ACKs is 
more pronounced with larger number of hops. 

We also performed simulations with increasing number 
of flows, i.e., nodes in wheel topology in Fig. SJc). It is 
seen in Fig. [7] that the total throughput achieved by NC 
schemes increases with the increasing number of flows. When 
the number of flows increases, the probability of NC at the 
intermediate node / increases. More NC opportunities leads 
to higher throughput. 

UDP traffic. We repeated the simulations for the three 
topologies for the case that there is loss over all links. The 
results are presented in Fig. 

Fig. [HJa) presents the results for the X topology. The 
improvement of UDP+I 2 NC-stateless is up to 60% as com- 
pared to UDP+noNC. This is significantly higher than the 
improvement of TCP+I 2 NC-stateless and the optimal scheme 
(in which the improvement is 33.3%). The reason is the MAC 



gain as explained in |2|0 We present the results for the 
load at which the system saturates. At this load, UDP+noNC 
is already saturated, several packets are dropped from the 
buffers, and they do not arrive to their receivers. This reduces 
the throughput of noNC, while NC schemes still handle the 
traffic created by the load. Notice that even at 50% loss 
rate, UDP+I 2 NC-stateless improves over UDP+noNC by 40%, 
which is significant. 

Fig. |8jb) presents the results for the cross topology. In this 
topology, the improvement of NC is very large. When there is 
no loss, the improvement is around 250%. The effectiveness 
of UDP+I 2 NC-stateless is also significant in this topology: at 
50% loss rate the improvement of UDP+I 2 NC-stateless over 
UDP+noNC is 70%. 

Fig-EJc) presents the results for multi-hop topology. We see 
similar behavior as observed by Figs. Ob) and (c). However, 
the improvement of UDP+I 2 NC-stateless is larger in this 
topology, because the benefit of eliminating ACKs is more 
pronounced with larger number of hops. 

C. Numerical Results 

We consider the X and cross topologies shown in Figs. Ua) 
andHfb). In the X topology, A\ transmits packets to A 2 via / 
with rate x\, and B\ transmits packets to B2 via / with rate X2. 
In the cross topology, Ai transmits packets to A2 with rate x\, 
A2 transmits packets to A\ with rate X2, B\ transmits packets 
to £>2 with rate x$, and B2 transmits packets to B\ with rate 
.T4. All transmissions are via /. In both topologies, the data rate 
of each link is set to 1 packet/transmission. We compare our 
schemes I 2 NC-state and I 2 NC-stateless with noNC which is 
also formulated in a network utility maximization framework 
without any NC constraints. 

Fig. [9] shows the total throughput; x\ + X2 for X topology. 
Fig. [9ja) shows the results when there is loss on A\ — B2- 
It is seen that the throughput of noNC is flat with increasing 

12 The MAC gain observed with UDP flows when NC is used can be 
summarized as follows. When NC is employed, the coded wireless network 
can handle larger amount of load as compared to its uncoded counterpart. 
Therefore, when coded system saturates at some load level, uncoded system 
can not handle this level of load. Thus, several packets are dropped from 
output queues at each node in the system. Some of these packets may be 
dropped from intermediate packets. In this case, resources (bandwidth in 
our case) to transmit these packets (which will be eventually dropped) is 
wasted. Therefore, the gap between the achieved throughput level of coded 
and uncoded systems becomes significant. 




Fig. 8. Total UDP throughput vs. average loss rate (the same loss rate is assumed on all links) in three different topologies. 
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(c) Loss on A± — B2 and I — B2 (d) Loss on all links 

Fig. 9. X topology. Throughput vs. loss rate (the same loss rate is assumed 
on A\ — B2 and / — B2 in (c), and the same loss rate is assumed on all 
links in (d)). 

loss rate, because it is not affected by the loss rate on the 
overhearing link. I 2 NC-state and I 2 NC-stateless improve over 
noNC, because they exploit NC benefit. When the loss rate in- 
creases, the improvement reduces, because B2 overhears only 
part of the data transmitted by A\. Although the improvement 
decreases with increasing loss rate, it is still significant, e.g., 
16.6% at 50% loss rate. Note that Fig. [9ja) is the counterpart 
of the simulation results presented in Fig. HJa). It is seen that 
TCP+I 2 NC-stateless in Fig. |5ja) shows similar performance 
as I 2 NC-stateless in Fig. |9|a). This shows the effectiveness of 
I 2 NC-stateless in a realistic simulation environment. 

Fig. |9jb) shows the results when there is loss on I — B2. 
It is seen that I 2 NC-state and I 2 NC-stateless improve over 
noNC significantly at all loss rates. It is also interesting to note 
that at 50% loss rate, I 2 NC-state and I 2 NC-stateless improve 
over noNC by 44% which is even higher than in the no loss 
case (33%). The reason is in the following. In the optimal 
solution, the throughput values are x\ = 0.4 and X2 = 0.2. 
In this case, in the downlink I — Bi, data part of xn with 
rate 0.2 and the parity part with rate 0.2 (considering loss rate 
50%) are combined with x\. This means that our schemes 



combine both parity and data parts of a flow with other flows 
and this improves the throughput significantly. This is one of 
the important contributions of I 2 NC. 

Fig. |9jc) shows the results when there is loss on links 
A\ — B2 and I — Bi- It is seen that I 2 NC-state improves 
the throughput significantly while the improvement of I 2 NC- 
stateless reduces to with increasing loss rate. The reason 
is that, I 2 NC-stateless is a more conservative scheme as 
compared to I 2 NC-state in the sense that it eliminates the 
perfect knowledge on antidotes. Yet, it still improves the 
throughput significantly, e.g., it improves over noNC by 22% 
at 30% loss rate. 

Fig. |9jd) shows the results when there is loss on all links. It 
is seen that I 2 NC-state and I 2 NC-stateless improve over noNC 
significantly at all loss rates. Note that throughput of I 2 NC- 
stateless reduces to that of noNC at 50% loss rate in Fig. |9|c). 
The reader might wonder why we do not see such behavior in 
Fig-|9|d). The reason is that since there is loss over link A\ — I 
as well as A\ — B2, the number of parities added by A\ to 
correct losses over link A\ — I also increases the number of 
overheard packets at £>2- Therefore, I 2 NC-stateless does not 
add redundancy at node / for both A\ — B2 and I — B2 as in 
Fig. |9lc), but adds redundancy only for loss on link 1 — B%. 
This improves the performance of I 2 NC-stateless . Note that 
the counterpart of these results are presented in Fig. HJa). It is 
seen that the throughput improvement of I 2 NC-stateless over 
noNC at 50% loss rate is around 30% in Fig. |9jd). As com- 
pared to this, the improvement of TCP+I 2 NC-stateless over 
noNC is limited in Fig. |6ja), because, in simulations, the block 
size is limited and fixed, and the scheduling is not perfect (we 
consider IEEE 802.11). Yet, the throughput improvement of 
TCP+I 2 NC-stateless over noNC is around 20% in Fig. gta), 
which is significant. 

Fig. [lO] shows the total throughput; x\ + X2 + £3 + £4 for 
I 2 NC-state, I 2 NC-stateless and noNC for the cross topology 
shown in Fig. |4jb) for different loss patterns. It is seen that 
the results are similar to the ones in Fig. [9] One difference 
is that the throughput improvement of NC schemes is higher, 
i.e., up to 80%, because there are more NC opportunities in 
the cross topology. 

VII. Conclusion 

In this paper, we proposed I 2 NC: a one-hop intra- and inter- 
session network coding approach for wireless networks. I 2 NC 
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(c) Loss on A± — B2 and / — B2 (d) Loss on all links 

Fig. 10. Cross topology. Throughput vs. loss rate (the same loss rate is 
assumed on Ai — B2 and I — B2 in (c), and the same loss rate is assumed 
on all links in (d)). 

builds on and improves COPE in two aspects: it is resilient 
to loss and it does not need to rely on the exact knowledge 
of the state of the neighbors. Our design is grounded on a 
NUM formulation and its solution. Simulations in GloMoSim 
demonstrate significant throughput gain of our approach com- 
pared to no network coding and COPE. 
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Appendix A: Convergence Analysis 

In this section, we analyze the convergence of the distributed 
solution of the NUM problem, given in Section [IV] First, 
we provide a proof of convergence, and then present some 
numerical calculations to verify the convergence. 



A. Proof of Convergence 

Let us first consider the optimality conditions below. Note 
that x s , a h k , r h k , and q h k are the optimal values. 
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When we arrange the terms in the above inequality by 
adding and removing terms, we have; 
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Since the marginal utility U s (.) is a decreasing function, its 
inverse, i.e., the Eq. d22l i is less than 0. Due to the optimality 
condition in Eq. dT3b and Eq. ( TT3T >. Eq. d23l is less than 0. Due 
to the optimality condition in Eq. (TPfl) . Eq. d24l i is less than 
0. Due to the optimality condition in Eq. dT6b . Eq. dZSb is less 
than 0. Thus, V(q, r, a) < 0. This implies the convergence of 
our solutions, l36l . 1371 . 

B. Numerical Results 

We consider again the X and cross topologies shown in 
Figs. Ufa) and Efb). In the X topology, A\ transmits packets 
to A2 via / with rate x\, and B\ transmits packets to B2 via 
I with rate x^. In the cross topology, A\ transmits packets 
to A2 with rate xi, A2 transmits packets to A\ with rate X2, 
Bi transmits packets to B2 with rate xs, and B2 transmits 
packets to B\ with rate X4. All transmissions are via /. 
In both topologies, the data rate of each link is set to 1 
packet/transmission and the loss rate is set to 30%. 

In Figs. QT| and [12] we present the throughput vs. the 
iteration number for the X topology at different loss patterns 
for I 2 NC-state and I 2 NC-stateless, respectively. Each figure 
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Fig. 11. X topology. Convergence of x\, X2, and x\ + X2 for I NC-state. Loss rate 
is 30%. 
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Fig. 13. Cross topology. Convergence of x\, X2, X3, x& t and xi + x 2 + x% + X4 
for I 2 NC-state. Loss rate is 30%. 
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Fig. 12. X topology. Convergence of x\, x 2 , and x\ + x 2 for I NC-stateless. Loss 
rate is 30%. 



shows the convergence of x\, x-i, and x\+X2 to their optimum 
values. E.g., x\ + x-i converges to its optimum value 0.59 in 
Fig. fTTT c) and x\ + X2 converges to its optimum value 0.55 
in Fig. H21c). 

Fig. [13] and [H present the throughput vs. the iteration ^ Loss on links A i - B ? and 1 - B * ^ Loss on dl links 

number for the cross topology at different loss patterns for 

I 2 NC-State and I 2 NC-Stateless, respectively. We See similar R g- 14 - Cross topology. Convergence of x u x 3 , x 3 , Xi , and x x + x 2 + x 3 + X4 

for I - NC-stateless. Loss rate is 30%. 

convergence results. Specifically, each flow rate, xi, x%, X3, 
X4, and the total rate converge to their optimum values. 




